Search CORE

153 research outputs found

Online approximations for wind-field models

Author: A. Stoffelen
A. Stoffelen
C. M. Bishop
D. J. Evans
D. Offiler
G. Kimeldorf
I. T. Nabney
M. Opper
N. A. Cressie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2001
Field of study

We study online approximations to Gaussian process models for spatially distributed systems. We apply our method to the prediction of wind fields over the ocean surface from scatterometer data. Our approach combines a sequential update of a Gaussian approximation to the posterior with a sparse representation that allows to treat problems with a large number of observations

Crossref

Aston Publications Explorer

Modelling Issues in Kernel Ridge Regression

Author: A B Kock
A J Smola
B Sch�lkopf
D S Broomhead
G C Cawley
G S Kimeldorf
H Raiffa
H White
J H Stock
J H Stock
J Mercer
K Yao
M C Medeiros
N Aronszajn
P Exterkate
Peter Exterkate
S Bochner
S C Ludvigson
S C Ludvigson
T Hofmann
T Poggio
T Ter�svirta
T Ter�svirta
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

Crossref

Predicting complex traits using a diffusion kernel on genetic markers with an application to dairy cattle and wheat data

Author: AE Hoerl
AJ Lorenz
AJ Smola
C Saunders
CR Henderson
D Gianola
D Gianola
D Gianola
D Gianola
D Gianola
D Gianola
D Habier
Daniel Gianola
F Fouss
G de los Campos
G de los Campos
G Kimeldorf
G Kimeldorf
G Morota
Gota Morota
Guilherme J M Rosa
H Shao
I Strandén
IR Kondor
J Crossa
J Lafferty
J Yang
JP Vert
Kent A Weigel
L Loewe
L Xu
LC Evans
M Gönen
Masanori Koyama
N Long
N Long
O González-Recio
O González-Recio
PM VanRaden
S Tsuruta
SVN Vishwanathan
T Gärtner
TFC Mackay
TH Meuwissen
THE Meuwissen
U Ober
U Ober
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models

Author: A Subramanian
B Schölkopf
D Eisenberg
D Liu
D Zhang
Dawei Liu
Debashis Ghosh
G Kimeldorf
JJ Goeman
JJ Goeman
JJ Goeman
KD Dahlquist
M Raponi
N Breslow
P Grosu
P McCullagh
R Davies
R Davies
S Dhanasekaran
S le Cessie
SG Self
SW Doniger
V Vapnik
Xihong Lin
Z Wei
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Growing interest on biological pathways has called for new statistical methods for modeling and testing a genetic pathway effect on a health outcome. The fact that genes within a pathway tend to interact with each other and relate to the outcome in a complicated way makes nonparametric methods more desirable. The kernel machine method provides a convenient, powerful and unified method for multi-dimensional parametric and nonparametric modeling of the pathway effect. Results In this paper we propose a logistic kernel machine regression model for binary outcomes. This model relates the disease risk to covariates parametrically, and to genes within a genetic pathway parametrically or nonparametrically using kernel machines. The nonparametric genetic pathway effect allows for possible interactions among the genes within the same pathway and a complicated relationship of the genetic pathway and the outcome. We show that kernel machine estimation of the model components can be formulated using a logistic mixed model. Estimation hence can proceed within a mixed model framework using standard statistical software. A score test based on a Gaussian process approximation is developed to test for the genetic pathway effect. The methods are illustrated using a prostate cancer data set and evaluated using simulations. An extension to continuous and discrete outcomes using generalized kernel machine models and its connection with generalized linear mixed models is discussed. Conclusion Logistic kernel machine regression and its extension generalized kernel machine regression provide a novel and flexible statistical tool for modeling pathway effects on discrete and continuous outcomes. Their close connection to mixed models and attractive performance make them have promising wide applications in bioinformatics and other biomedical areas.</p

Crossref

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Collection Of Biostatistics Research Archive

Harvard Dataverse Network

Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data

Author: 1000 Genomes Project Consortium
AP Morris
B Li
BE Madsen
D Liu
D Liu
D Zhang
G Kimeldorf
Hongyu Zhao
John Ferguson
Joon Sang Lee
LA Almasy
LC Kwee
Lun Li
M Choi
MC Wu
R McPherson
RH Duerr
Wei Zheng
Wellcome Trust Case Control Consortium
Xianghua Zhang
Xiting Yan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Recently there has been great interest in identifying rare variants associated with common diseases. We apply several collapsing-based and kernel-based single-gene association tests to Genetic Analysis Workshop 17 (GAW17) rare variant association data with unrelated individuals without knowledge of the simulation model. We also implement modified versions of these methods using additional information, such as minor allele frequency (MAF) and functional annotation. For each of four given traits provided in GAW17, we use the Bayesian mixed-effects model to estimate the phenotypic variance explained by the given environmental and genotypic data and to infer an individual-specific genetic effect to use directly in single-gene association tests. After obtaining information on the GAW17 simulation model, we compare the performance of all methods and examine the top genes identified by those methods. We find that collapsing-based methods with weights based on MAFs are sensitive to the “lower MAF, larger effect size” assumption, whereas kernel-based methods are more robust when this assumption is violated. In addition, many false-positive genes identified by multiple methods often contain variants with exactly the same genotype distribution as the causal variants used in the simulation model. When the sample size is much smaller than the number of rare variants, it is more likely that causal and noncausal variants will share the same or similar genotype distribution. This likely contributes to the low power and large number of false-positive results of all methods in detecting causal variants associated with disease in the GAW17 data set

Crossref

Springer - Publisher Connector

PubMed Central

Bayesian classification of tumours by using gene expression data

Author: Albert J.
Aronszajn N.
Bernardo J. M.
Bishop C.
Cristianini N.
Denison D.
Figueiredo M.
Gelfand A.
Gelfand A.
Gelman A.
Hastie T. J.
Herbrich R.
Holmes C.
Kimeldorf G.
MacKay D.
Moler E. J.
Neal R.
Parzen E.
Pontil M.
Ripley B. D.
Rosenblatt F.
Schena M.
Scholkopf B.
Sollich P.
Specht D. F.
Tipping M.
Vapnik V. N.
Wahba G.
Wahba G.
Wahba G.
West M.
Williams C.
Xiong M.
Zhu J.
Publication venue: 'Wiley'
Publication date: 01/04/2005
Field of study

Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/75678/1/j.1467-9868.2005.00498.x.pd

Crossref

Research Papers in Economics

Deep Blue Documents at the University of Michigan

Simultaneous model-based clustering and visualization in the Fisher discriminative subspace

Author: A. Jain
A. Montanari
A. Raftery
C. Biernacki
C. Biernacki
C. Bishop
C. Bouveyron
C. Fraley
C. Maugis
Camille Brunet
Charles Bouveyron
D. Foley
D. Rubin
D. Scott
D.A. Clausi
E. Anderson
E. Tipping
G. Celeux
G. Celeux
G. Golub
G. Kimeldorf
G. McLachlan
G. McLachlan
G. McLachlan
G. Schwarz
H. Akaike
I. Jolliffe
J. Baek
J. Friedman
J. Ye
J. Ye
K. Fukunaga
K. Liu
L. Parsons
M. Law
N. Campbell
N. Trendafilov
P. Howland
P. McNicholas
R. Agrawal
R. Bellman
R. Duda
R. Fisher
S. Boutemedjet
T. Alexandrov
T. Hastie
T. Hastie
W. Krzanowski
Y. Hamamoto
Y.F. Guo
Z. Jin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/04/2011
Field of study

Clustering in high-dimensional spaces is nowadays a recurrent problem in many scientific domains but remains a difficult task from both the clustering accuracy and the result understanding points of view. This paper presents a discriminative latent mixture (DLM) model which fits the data in a latent orthonormal discriminative subspace with an intrinsic dimension lower than the dimension of the original space. By constraining model parameters within and between groups, a family of 12 parsimonious DLM models is exhibited which allows to fit onto various situations. An estimation algorithm, called the Fisher-EM algorithm, is also proposed for estimating both the mixture parameters and the discriminative subspace. Experiments on simulated and real datasets show that the proposed approach performs better than existing clustering methods while providing a useful representation of the clustered data. The method is as well applied to the clustering of mass spectrometry data

arXiv.org e-Print Archive

HAL Evry

Crossref

HAL-Paris1

Genome-assisted prediction of a quantitative trait measured in parents and progeny: application to food conversion rate in chickens

Author: A Legarra
A Legarra
Andreas Kranis
B Efron
B Efron
C Andreescu
CJF ter Braak
CR Henderson
D Gianola
D Gianola
D Gianola
DA Sorensen
Daniel Gianola
F Guillaume
G Kimeldorf
G Wahba
G Wahba
G Wahba
GK Wong
Guilherme JM Rosa
I Misztal
J Dekkers
JL Hirschhorn
Kent A Weigel
L Janss
L Varona
L Wasserman
LG Gaya
LR Schaeffer
N Long
O González-Recio
Oscar González-Recio
R Fernando
R Tibshirani
RAE Pym
S Xu
T Park
THE Meuwissen
W Zhang
WJ Ewens
WM Muir
X Ye
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Accuracy of prediction of yet-to-be observed phenotypes for food conversion rate (FCR) in broilers was studied in a genome-assisted selection context. Data consisted of FCR measured on the progeny of 394 sires with SNP information. A Bayesian regression model (Bayes A) and a semi-parametric approach (Reproducing kernel Hilbert Spaces regression, RKHS) using all available SNPs (p = 3481) were compared with a standard linear model in which future performance was predicted using pedigree indexes in the absence of genomic data. The RKHS regression was also tested on several sets of pre-selected SNPs (p = 400) using alternative measures of the information gain provided by the SNPs. All analyses were performed using 333 genotyped sires as training set, and predictions were made on 61 birds as testing set, which were sons of sires in the training set. Accuracy of prediction was measured as the Spearman correlation (r¯S) between observed and predicted phenotype, with its confidence interval assessed through a bootstrap approach. A large improvement of genome-assisted prediction (up to an almost 4-fold increase in accuracy) was found relative to pedigree index. Bayes A and RKHS regression were equally accurate (r¯S = 0.27) when all 3481 SNPs were included in the model. However, RKHS with 400 pre-selected informative SNPs was more accurate than Bayes A with all SNPs

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer